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Spotting Trees with Few Leaves* * * § 

Andreas Bjorklund! Vikram Karnatf Lukasz Kowalik^ Meirav Zehavi^ 


Abstract 

We show two results related to the Hamiltonicity and /c-Path algorithms in undi¬ 
rected graphs by Bjorklund [FOCS’lO], and Bjorklund et al., [arXiv’10]. First, we demon¬ 
strate that the technique used can be generalized to finding some k -vertex tree with l leaves 
in an n-vertex undirected graph in 0*(1.657 fc 2k 2 ) time. It can be applied as a subroutine 
to solve the ^-Internal Spanning Tree (fc-IST) problem in 0*(min(3.455 fe , 1.946 n )) 
time using polynomial space, improving upon previous algorithms for this problem. In 
particular, for the first time we break the natural barrier of 0*( 2"). Second, we show that 
the iterated random bipartition employed by the algorithm can be improved whenever the 
host graph admits a vertex coloring with few colors; it can be an ordinary proper vertex 
coloring, a fractional vertex coloring, or a vector coloring. In effect, we show improved 
bounds for Zc-Path and Hamiltonicity in any graph of maximum degree A = 4,..., 12 
or with vector chromatic number at most 8. 


1 Introduction 

In this paper we study the (. k , Z)-Tree problem. Given an undirected host graph G on n 
vertices, (. k , Z)-Tree asks if G contains a tree T on k vertices, such that the number of leaves 
in T is exactly l. This problem is a natural generalization of the classic fc-PATH problem: for 
l = 2, the definitions of (k, Z)-Tree and A;-Path coincide. For k = n and l = 2 we get the 
classic Hamiltonian Path. Furthermore, (. k , Z)-Tree is tightly linked to the well-studied 
A;-Internal Spanning Tree (fc-IST) problem, which asks if a given graph G on n vertices 
contains a spanning tree T with at least k internal vertices. Indeed, it is well-known that a 
yes-instance of fc-IST is a yes-instance of (k + l, Z)-Tree for some l < k, and vice versa (see 
Section [3]). 

Because of the connections to Hamiltonian Path, the (k, Z)-Tree, fc-IST and /c-Path 
problems, even in bipartite graphs or in graphs of bounded degree 3, are NP-hard. In this 
paper we study parameterized algorithms, i.e., algorithms that attempt to solve NP-hard 
problems by confining the combinatorial explosion to a parameter k. More precisely, a problem 
is fixed-parameter tractable (FPT) with respect to a parameter k if it can be solved in time 
0*(f(k )) for some function /, where O* hides factors polynomial in the input size. Our 
results have also consequences in the field of moderately exponential-time algorithms, where 
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Reference 

Det./Rand. 

Space 

Graph G 

Running Time 

Prieto et al. [37] 

det 

poly 

Undirected 

Q*(2°l k i°g fc )) 

Gutin al. [25J] 

det 

exp 

Directed 

Q*(20{klogk)'j 

Cohen et al. [8] 

det 

exp 

Directed 

0*( 55.8 k ) 


rand 

poly 

Directed 

0*(49.4 fc ) 

Fomin et al. [P)| 

det 

exp 

Directed 

(3*(16 fc +°( fe )) 


rand 

poly 

Directed 

(3*(16 fc +°( fe )) 

Fomin et al. [18] 

det 

poly 

Undirected 

lr {¥) 

Shachnai et al. [42] 

det 

exp 

Directed 

0*(6.855 fe ) 

Daligault [TO]. Zehavi [44] 

rand 

poly 

Directed 

0*1 4 s ) 

Li et al. 311 

det 

poly 

Undirected 

0*(4 fe ) 

Zehavi [45] 

det 

exp 

Directed 

0*(5.139 fe ) 


rand 

exp 

Directed 

0*(3.617 fc ) 

This paper 

rand 

poly 

Undirected 

0*(3.455 k ) 


Table 1: Known FPT algorithms for A;-1ST. 


one aims at providing an 0(c n )-time algorithm, with the constant c > 1 being as small as 
possible. 

We develop an FPT algorithm for (k, /)-Tree in general graphs that relies upon a non¬ 
trivial generalization of the technique underlying the Hamiltonicity and /c-Path algorithms 
in [2UU- As a result, we break the natural barrier of 0*{2 n ) in the running time bound for the 
fc-IST problem. Then, we conduct a thorough examination of the (k , £)-Tree problem in spe¬ 
cial classes of graphs that admit a vertex coloring with few colors. This in turn implies faster 
algoritms for Hamiltonicity, UPath and fc-IST in bounded degree graphs. Apart from 
the classic vertex coloring, we consider fractional coloring and vector coloring, thus showing 
that the latter tool, famous in approximation algorithms, is also helpful in parameterized 
complexity. 

1.1 Related Work 

The A;-IST problem, along with its directed version known as /c-Internal Out-Branching 
(fc-IOB0), are of interest in database systems m and water supply network design [38] . 
Note that any algorithm for /c-IOB works also for /c-IST (after replacing every edge of the 
input graph by two oppositely oriented arcs). Over the last 10 years fc-IST and fc-IOB were 
heavily researched, resulting in a number of algorithms using a variety of approaches. We 
present the progress for FPT algorithms in Table [1] For the special case of graphs of bounded 
degree A, Zehavi [ 41 ] shows an algorithm for fc-IOB running in time 0*(4 l 2 A(a-iR An¬ 
other specialized algorithm, due to Raible et al. [38] . solves /c-IST in graphs of maximum 
degree 3 in time 0*( 2.137 fc ). There also has been quite some interest in moderately expo¬ 
nential time algorithms for fc-IST. Unlike in many other graph problems, even an 0*(2 n ) 
algorithm is not completely trivial. Algorithms achieving this bound were shown by Raible 
et al. [38] (with exponential space) and Nederlof [36] (polynomial space). Note that 0*(2 n ) 

1 In fc-IOB problem, we need to decide if a directed graph G contains a spanning tree T with exactly one 
vertex of in-degree 0 and at least k internal vertices. 
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is a natural barrier, as it corresponds to the number of all subsets of vertices. During recent 
years researchers managed to surpass this barrier for a number of problems, like Dominat¬ 
ing Set by Grandoni [231, Feedback Vertex Set by Razgon [39] and Hamiltonicity 
by Bjorklund [2j. Raible et al. [38] explicitly posed an open question asking whether the 
approach of Bjorklund |2] can be extended to fc-IST either for general graphs or for graphs 
of large vertex cover. Raible et al. [38] were able to cross the 0*(2 n ) barrier for graphs of 
bounded degree — they get an algorithm running in time 0*(( 2 A+1 — 1) a- 1 ) and present a 
further improvement to O* (1.862™) when A = 3. 

The (fc, Z)-Tree problem was implicitly introduced by Cohen et al. [3] as a tool for solv¬ 
ing /c-IOB. They obtained a 0*(6.14 fc )-time algorithm. Currently, the fastest randomized 
algorithms, due to Daligault [10] and Zehavi [33], run in time 0*{2 k ) and have a polynomial 
space complexity. Note that we meet the 0*{2 k ) barrier again. For the deterministic case, 
Zehavi [35] (building upon [ 32 ]) shows a 0*(2.597 fc )-time algorithm for (fc, Z)-Tree with an 
exponential space complexity. 

The fundamental A;- Path problem is well-studied in the field of Parameterized Complexity. 
In the past three decades, it enjoyed a race towards obtaining the fastest FPT algorithm 
(see mmmm ECO ED, GO 133 ., m B2D. Currently, the best randomized algorithm, due to 
Bjorklund et al. [3], runs in time 0*(1.657 fc ) and has a polynomial space complexity, and 
the best deterministic algorithm, due to Zehavi [45] runs in time 0*(2.597 fc ) and has an 
exponential space complexity. 

The result in [3] extends Bjorklund’s 0*(1.657 n ) time algorithm for Hamiltonicity in [2]. 
The same paper contains also a better bound of 0*(2 n / 2 ) for bipartite graphs. Hamiltonic¬ 
ity in graphs of bounded maximum degree A received a considerable attention beginning with 
the paper of Eppstein [13], followed by works of Iwama and Nakashima [28] , Gebauer [22] 
and Bjorklund et al. [3]- Currently the best algorithm for A = 3, due to Cygan et al. [9], 
runs in time 0*( 1.201™) [9], while prior to this work for A > 4 the best bound was that of 
the general Bjorklund’s 0*(1.657 n )-time algorithm. 

1.2 Our Contribution 

We obtain improved algorithms for ( k , £)-Tree, /c-IST, A;- Path and Hamiltonicity. All of 
our algorithms are randomized Monte-Carlo and have polynomial space complexities. Our 
contribution is twofold. 

From Paths to Trees. First, we develop an algorithm that solves (k , /)-Tree in general 
graphs in time 0*( 1.657 fe 2^/ 2 ). This can be seen as a generalization of the technique of 
Bjorklund et al. ei m from detecting paths to detecting trees. In the original technique 
one enumerates walks of length k (rather than paths, which are walks without repeating 
vertices) and then uses an algebraic tool to sieve-out the walks which are not paths. The 
algebraic tool is to design a polynomial which is a sum of monomials, each corresponding 
to a walk. The trick is that thanks to the use of bijective labellings, the non-path walks 
can be paired-up so that both corresponding monomials are the same and thus cancel over 
a held of characteristic 2. It is quite clear that in the tree case walks should be replaced by 
branching walks (see [35]4. However, the main difficulty lies in the pairing argument, which 
requires a new labelling scheme and becomes much more delicate. Indeed, this extension is 
non-trivial as it is exactly the topic of the open problem posed by Raible et al. [38] mentioned 
in the previous section. Our algorithm, similarly as in [3] uses a random bipartition of the 
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A 

Running Time 

3 

0*( 1.5705 fe ) 

4 

0*( 2 2k / 3 ) = 0*( 1.5874 fc ) 

5,6 

0*(2 7k / w ) = 0*( 1.6245 fe ) 

7,8 

0*( 2 5k / 7 ) = 0*( 1.6406 fc ) 

9,10 

0 *( 2 i3fc/i8) = o*(i.6497 fc ) 

11,12 

0*(2 8fc / n ) = 0*(1.6555 fc ) 


Xv(G) 

Running Time 

4 

0*(1.6199 fe ) 

5 

0*(1.6356 fe ) 

6 

0*(1.6448 fe ) 

7 

O*(1.6510 fe ) 

8 

0*(1.6554 fe ) 


Table 2: Running times of our Zc-Path algorithm in bounded degree graphs (left), and 
bounded vector chromatic number (right). Replacing k by n gives bounds for Hamiltonic- 
ity. 

vertices. The running time depends in a crucial way on a random variable (called the number 
of needed labels) in the resulting probability space. The second difficulty was to determine 
the distribution of this variable (Lemma 12 .101) . and again this turned out to be much more 
tricky than in the path case. 

The (k , Z)-Tree algorithm described above already implies an improved running time for 
fc-IST in general graphs. However, it works much faster if the hidden spannning tree has few 
leaves. We design a different strategy, based on finding a maximum matching, to accelerate 
the algorithm when it looks for solutions with many leaves. By merging the two strategies, 
we obtain an 0*(3.455 fc ) time algorithm for A’-IST. This is the first 0*((4 — H(l)) fe )-time 
algorithm that uses polynomial space. It immediately implies a moderately exponential time 
algorithm running in time 0*(1.946 n ), which breaks the 0*(2 n ) barrier. 

Paths and Trees in Colored Graphs. Next, we study the (k, Z)-Tree problem in graphs 
that admit a vertex coloring with d colors. This can be seen as an extension of Bjorklund’s 
0*( 2 n / 2 )-time algorithm for Hamiltonicity in bipartite graphs [2], However, the insight 
in [2] was to find a small vertex cover of the hidden solution. Here, we use a different 
insight: by choosing roughly half of the color classes we get a small subset of the hidden 
solution vertices which covers many (but not all) of its edges. The resulting algorithm runs in 

0*(2 y d(d-i) > ) time when l = 0(1) (see Section 14. II for a more complicated bound in the 
general case). For a graph of bounded degree A that is neither complete nor an odd cycle, it 
is possible to construct a proper A-coloring in linear time (e.g., by Lovasz’s proof of Brooks’ 
theorem [33]). This immediately results in a fast algorithm for (A, Z)-Tree in bounded degree 
graphs (see Table [2] for the special cases of A:- Path and Hamiltonicity; the case A = 3 
is solved by a special algorithm, see below), along with an improved algorithm for &-IST in 
such graphs (see Table [3] in Section 15.11) . 

Fractional coloring is a well studied generalization of the classical vertex coloring. Our 
algorithm for graphs of low chromatic number generalizes quite easily to the case of low frac¬ 
tional chromatic number. This has consequences in improved algorithms for some special 
graph classes, e.g. an 0*(1.571 fc 1.274 i )-time algorithm for (k, Z)-Tree in subcubic triangle- 
free graphs, or 0*(1.571 fc )-time algorithm for Zc-Path in general subcubic graphs. For sub- 
cubic graphs of even larger girth, we get further improved bounds. 

Another relaxation of the classical coloring is vector coloring, known for its importance 
in approximation algorithms (e.g. [23]). Its important advantage is that, contrary to the 
classical or fractional coloring, a (1 + e)-approximation can be found in polynomial time [21] . 
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We provide an algorithm for (fc, Z)-Tree that applies vector coloring. It results in improved 
running time when the vector chromatic number is at most 8 . 

1.3 Notation 

Throughout the paper we consider undirected graphs. For an integer k , by [A;] we denote the 
set {1,2,..., k}. For a set S and an integer k by ( fc ) we denote the family of all subsets of S 
of size k. Let us write V = V(G) = [n] for the vertex set and E = E(G) for the edge set of 
the host graph G. 

2 Finding Trees on k Vertices with l Leaves 

In this section we generalize the A;-Path algorithm by Bjorklund et al. El SI to finding subtrees 
with k nodes including l leaves. We focus on undirected graphs. Throughout this section we 
assume we are given a fixed partition V = V\ U V 2 of the vertices of the input graph. To 
get the promised generalization, in Section 12.61 we will use a random bipartition, similarly as 
in El SI- In the next sections we investigate other partitions. 

2.1 Branching Walks 

The notion of branching walk was introduced by Nederlof [35] . A mapping h : V (T) — > V (G) 
is a homomorphism from a graph T to the host G if for all {a, b} G E(T) it holds that 
{ h(a), h(b)} G E(G). We adopt the convention of calling the elements of V(T) nodes and the 
elements of V(G) vertices. 

A branching walk in G is a pair B = (T, h) where T is an unordered rooted tree and 
h : V[T) —>• V(G) is a homomorphism from T to G. 

Let B = (T, h ) be a branching walk in G.The walk starts from the vertex h( 1) in G. The 
size of the walk is \V(T)\. The walk is simple if h is injective. The walk is weakly simple 
if for any node x G V(T), the homomorphism h is injective on children of x. The walk is 
U-turn-free if for any node a of T, every child b of a maps to a different vertex than the parent 
c of a, i.e., h(b) 7 ^ h(c). 

A proper order of B is any permutation n : V(T) —> {1,..., |V r (T)|} such that for every 
two nodes a,b G V(T), 

(i) if depth(a) < depth(6) then 7r(a) < 7 r(6), 

(ii) if a,b ^ root(T) and 7r(parent(a)) < 7r(parent(6)) then ir(a) < 7 r(6), 

(in) if a and b are siblings and h(a ) < h(b) then 7r(a) < 7 r(6). 

The following proposition is immediate. 

Proposition 2.1. Every weakly simple branching walk has exactly one proper order. 

We say that a weakly simple branching walk B is properly ordered if V ( T ) = {1,..., |P(T)|} 
and the proper order from Proposition 12.11 is the identity function. 
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2.2 Labeling 

For a tree T, by L(T) we note the set of leaves of T and by I(T) we denote the set of internal 
vertices of T, i.e., I(T) = V(T) \ L(T). 

Like in [4] or [5] our crucial tool are labelled branching walks. In [5j, every node in the 
tree T of a branching walk ( T,h ) was assigned a label. Here, similarly as in |4j, we do not 
assign labels to some nodes, but we assign labels to some edges of T. We define the set of 
labellable elements of a branching walk B = (T, h) as 


la(.B) = L(T) U (h(I(T)) n Hi) U {uv £ E(T) : h[u ), h(v) £ H 2 }. 
Similarly, for a subtree T of graph G, define 


la(T) = L(T) U (J(T) n Hi) U {uv £ E(T) : u, v £ H 2 }. 

We say that a branching walk B = (T, h) is admissible when B is weakly simple, U-turn- 
free, and properly ordered. For nonnegative integers k, l, r, we also say that B is (k, l, r)-fixed 
if T has k nodes and l leaves, and | la(L>)| = r. 

2.3 The Polynomial 

Let r be an integer. We use three kinds of variables in our polynomial. First, for any edge 
uv £ E(G), where u < v, we have a variable x uv . For simplicity we will denote x vu = x uv . 
Second, for each q £ H U E and for each l £ [r] we have a variable y q j. Third, for each 
v £ H (G), we have a variable z v . By x we denote the sequence of all x m ,-type variables, while 
by y we denote the sequence of all y^j-type variables. 

For a branching walk B = ( [T,h ) and a labeling £ : la (B) —>■ [|la(H)|] we define the 
monomial 


mon(I?, £) ^h(l) 11 n Vh{q)/{q) i 

{u,v}&E(T ) gela(B) 

u<v 

where for uv £ E(T), h(uv ) refers to edge h(u)h(v ) £ E(G). Next, we define a multivariate 
polynomial Pj with coefficients in a field of characteristic two by setting 


£ £ mon(H, £). 

B=(T,h) «a(B)—!-[|la(B)|] 

B is admissible i bijective 
B is ( k , /, i)-fixed 

Finally, let P ri = Y!i =2 p i- 

Lemma 2.2. The set of pairs of the form (B,£), where B is a non-simple, admissible and 
( k, l, i)-fixed branching walk and £ : la (B) —> [|la(H)|] is a bijection, can be partitioned into 
pairs, and the two monomials corresponding to each pair are identical. 
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Proof. Fix a pair ( B ,£) from the set in the claim. 

Case 1. There exist two elements ei,e 2 £ la(-B) such that h(e±) = h(e 2 ). We take the pair 
with lexicographically minimal value of (min{£(ei), l(e 2 )}, max{£(ei), £(e 2 )}); note that there 
is a unique minimum since l is injective. 

We define £' : la (B) —> [| la(B)|] as follows: 


'*(«) 

if x = v, 

< £{v) 

if x = u, 

A x ) 

otherwise. 


Note that £' 7 ^ £ because i is injective. In this way, to the labelled branching walk ( B , l) we 
assigned another labelled branching walk (B,£'). Note that mon(L>,£) = mon(S,£ / ). Observe 
that (B,£') is also admissible and fixed, and B is non-simple and falls into Case 1. 

Moreover, if we begin from ( B,£') and follow the same way of assigning labels, we get ( B ,£). 
Thus we have paired up all the non-simple labelled branching walks that fall into Case 1. In 
what follows we assume that Case 1 does not apply. 

Observe that once Case 1 is excluded, no two edges of T map to the same edge of G. 
Indeed, assume that there is a pair of edges u\V\ and U 2 V 2 of T that map to the same edge 
uv £ E(G), with h{u\) = h(u 2 ) = u and h{v\) = h(v 2 ) = v. Then, either one of u or v is in V\ 
or both are in V 2 . By the definition of la(i?), in the prior case u\, U 2 £ la(i?), or v\, V 2 £ la(L>) 
and in the latter case u\Vi,U 2 V 2 £ la (B). In both situations Case 1 applies. 

Define 

A = {(u,v) : u,v £ V(T) and h(u ) = h(v)}. 

Since Case 1 does not apply, and B is not simple, we know that for every pair (u, v) £ A 
at most one of u, v is in la(-B). Let 

A* = {(u,v) £ A : h(u ) = min h(x)}. 

(x,y)eA 

Consider any node x £ V(T). Then, we define 


&{x) 


£{x) 

< min {£{z) : z £ children(x) C la(i?)} 
mm.{£{xz) : z £ children(x)} 


if x £ la(.B), 

if children(x) fl la (B) / 0, 
otherwise. 


Case 2. There is a pair (u, v) £ A* such that u is an ancestor of v. Let us define a subset 
A** C A* consisting of pairs (it, v) £ A* such that node u is an ancestor of v, and for every 
node x on the path from v to the root, if h(x) = h(u) then x £ {it,u}. We choose the 
pair (u,v) £ A** such that the pair (u,£^(v)) is lexicographically minimal. Note that there is 
exactly one such pair. Indeed, consider another pair (■uv') £ A** with (it', fG{v')) = (■u , £^{v)). 
Clearly, u, = v!. If v 7 ^ v' then, since £^(v) = £^{v'), it means that v is a child of v' or vice 
versa. However, h(v ) = h(v'), so h is not a homomorphism, a contradiction. 

Let T' be the tree obtained from T by reversing the path from u to v. More precisely, 
consider the path p uv = uu\ ■ ■ ■ UfV. Remove the edges uu\ and UfV from T, obtaining three 
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trees: T u (rooted at root(T)), T p (rooted at u\), and T v (rooted at v ). Next, root T p at vt, 
obtaining T p . Finally, make the root of T p the child of u in T u , and make v the child of u\ in 
T p . The resulting tree is denoted by T'. 

Note that the multigraphs h(T') and h(T) are equal. Now we essentially want to leave h 
unchanged; however the branching walk B' = (T r , h ) is not properly ordered and we need to 
renumber the nodes. 

Note that B' is weakly simple since no two edges of T (and hence also of T r ) map to the 
same edge of G. Hence, by Proposition 12.11 there is exactly one proper order n of B ’. Let T" 
be obtained from T' by replacing every node x by n(x). Let us define homomorphism h" by 
h"{x) = h(n~ 1 (x)) for every x G V(T"), and let B" = ( T",h"). Note that T" has the same 
number of leaves and internal vertices as T and B" is properly ordered. Also, T" is weakly 
simple and U-turn-free, for otherwise T" has two edges that map by h" to the same edge e 
of G, which implies that T has two edges that map by h to e, a contradiction. Hence B is 
admissible. Note that there is a one-to-one correspondence between elements of la(-B) and 
la {B"). It follows that B" is fixed. 

Now let us define a labeling £" of B". For every node x € la(L>"), we put £"{x) = £(i r _1 (x)). 
Note that h(T ) = h(T") as multigraphs, and hence since in T no two edges are mapped by h 
to the same edge of G, the same holds for T". It follows that for every edge xy of T" there 
is a unique edge xo 2 /o of T such that h(xy) = h(xoyo). For every edge xy £ E(T") n la (B") 
we pick the corresponding edge xoyo of T, and we put £"{xy) = £(xoyo). The definitions of 
l" and h" imply that mon (B,£) = mon (B",£"). 

We claim that B" ^ B. Note that v 7 ^ u\, for otherwise h is not a homomorphism. Hence, 
the path p uv = uu\ ■ ■ ■ utv has at least one internal vertex, i.e., t > 1. Consider the walks 
W = h{u ), h(ui ),..., h(ut), h(v) and W" = h(v), h(ut), ■ ■ ■ , h(u\), h(u) in G. If B" = B then, 
in particular, W = W". But this implies that T is not U-turn-free, a contradiction. 

Hence to (B,£) we have assigned a different pair (. B",£"), which also satisfies Case 2 but 
does not satisfy Case 1. Let (u',v') be the pair of nodes chosen in Case 2 for (. B",£"). Note 
that in ( B",£") we have h"{u) = h"{v) = h"(u') = h"(v'). Since only the subtree of u is 
changed, for every node x < u we have n(x) = x. It follows that v! = u. Note that £"^ 
can differ from (G only for the vertices u \,... ,ut on the path p uv . However, none of them 
maps to h(u), because otherwise (u,v) ^ A**. Hence, v' = v. Thus we have paired up all the 
non-simple, admissible and (k, l, i)-fixed labelled branching walks that fall into Case 2 , but 
do not fall into Case 1. 

Case 3. There is a pair (it, v) € A* such that neither u is an ancestor of v nor v is an ancestor 
of it. Assume w.l.o.g. that h (parent(it)) < h (parent (v)). If there are many such pairs (u,v), 
we take the pair with lexicographically minimal value of ( h (parent(it)), h (parent(u))). We 
claim there is exactly one such pair. Indeed, consider another pair (it',i/). Then h{u) = 
h{u') = h{y) = h(y') since both pairs are in A*. Also h(parent(u)) = h(parent(it')) and 
h (parent (v)) = h (parent(i/)) by the minimality. Then u = v! for otherwise two edges of T 
map to the same edge h (u) h (parent( u )) of G. Similarly, v = v' for otherwise two edges of T 
map to the same edge /i(u)/i(parent(i;)) of G. 

Let T u and T v be the subtrees of T rooted at u and v, respectively. Let T' be the tree 
obtained from T by swapping T u and T v . Note that h(T') = h(T ) as multigraphs. Now we 
define a branching walk B" = (T". h") by renumbering vertices of T' using a proper order 7r, 
exactly as in Case 2. By the same argument as in Case 2, B" is admissible and (k,l,i)- fixed. 
We also define a labeling £" of B" exactly in the same way as in Case 2. It follows that 
rnon(HT) = mon (B" ,£"). 
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We claim that B" ^ B. Otherwise, there is an isomorphism i : V(T U ) — > V(T v ) such that 
for every node x £ V(T X ) we have h(x ) = h(i(x)). Pick any leaf 2 £ V{T U ). Then i(z ) is also 
a leaf. Hence both z and i[z ) are in la {B). It follows that Case 1 applies to z and i(z), a 
contradiction. 

It is clear that (B",£") also satisfies Case 3 and it does not satisfy Case 1. Assume it 
satisfies Case 2, i.e., there are nodes (u*,v*) £ A* such that u* is an ancestor of v*. Note 
that h(u*) = h(v*) = h(u ) = h(v). We can assume that u* = u or u* = v, for otherwise 
the pair (u,v) satisfies Case 2 for the labelled branching walk (B,£), a contradiction. By 
symmetry assume u* = u. Then {v,v*} satisfies Case 2 for the labelled branching walk 
( B , £), a contradiction again. Hence to ( B , £) we have assigned a different pair ( B", £"), which 
also satisfies Case 3 but does not satisfy the earlier cases. 

We claim that if we begin from (B", l") and follow the same way of assignment we get 
(B,£) back. Indeed, note that only the subtrees of u and v are changed, but since Case 2 is 
excluded, these subtrees contain no vertices that map to h(u), except for u and v. It follows 
that in ( B" ,£') the pair (tt(u),7t(v)) is chosen. After swapping the subtrees (and renumbering 
the vertices, redefining the labelling, etc.) we get (B,£) back. 

Thus we have paired up all the non-simple, admissible and (k, l, i)-£xed labelled branching 
walks that fall into Case 3, but do not fall into the earlier cases. This finishes the proof. □ 

Lemma 2.3. The polynomial Pp. is non-zero iff the input graph contains a subtree Tq with 
k nodes and l leaves, such that la(TG-) < r. 

Proof. First assume Pp. is non-zero. It follows that Pp. contains at least one monomial with 
non-zero coefficient. By Lemma [2.21 every such monomial corresponds to a simple branching 
walk B = (T,h), because all the monomials corresponding to non-simple branching walks 
cancel-out over a field of characteristic two. Hence h(T) is the desired tree. 

Let Tq be a subtree of G with k nodes and l leaves such that la(T) < r. Pick an arbitrary 
vertex v of Tq and let T v be tree Tq rooted at v. By Proposition l2.il there is a proper order 
7T of the simple branching walk (T v , idy^). Let T denote the tree T v after replacing every 
vertex x by 7 t(.t). Define a homomorphism h : V(T) —> V(G) by putting h(x) = ir~ 1 (x). Then 
B = ( T , h) is admissible. Note that there is a one-to-one correspondence between la (Tq) and 
la(P). It follows that B is (A;, l, \ la(TG)|)-fixed. 

Finally, choose an arbitrary bijection £ : la(P) —> [la(P)]. We must now have P ^ 0 be¬ 
cause we can uniquely reconstruct the pair (P, £) from its monomial representation mon(P, £): 
indeed, first recover the root vertex v from the unique z„-type indeterminate; second, recover 
B = (T, h ) from the x a! b-type indeterminates using the fact that B is simple, properly or¬ 
dered, and starts from u; then recover £ from the y x j -type indeterminates using the fact that 
£ is injective. □ 

2.4 Evaluating the Polynomial P^ r in Time 0*(2 r ) 

In this section we show that the polynomial Pp. can be evaluated in a given point (x, y, z) 
in time 0*{ 2 r ). Clearly, it suffices to show this bound for every polynomial Pj. To this end, 
let us rewrite Pi as a sum of 2 r polynomials such that each of them can be evaluated in time 
polynomial in the input size. For each X C [r], let 
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p t x = mon (b,£). 

B=(T,h) £:\&(B)->X 

B is admissible 
B is (k, /, i)-fixed 

Note that we do not assume that the labelings in the second summation are bijective. 

Lemma 2.4. Pi = P x . 

XQ{l,2,...,k} 

Proof. Let us fix an admissible (k, l, i)-fixed branching walk B = (T, h ). Note that a function 
i : la(-B) —> [| la(-B)|] is bijective if and only if it is surjective, so 

mon (B,£) = ^ mon (B,£). (1) 

£:la(S)—>[| la(B)|] «a(B)-s-[| Ia(B)|] 

i bijective t surjective 

Recalling the coefficients of P.[ are from a field of characteristic 2, and hence —1 = 1, we have, 
by the Principle of Inclusion and Exclusion, 

mon (B,£) = E E mon(B,£). (2) 

f:la(S)->-[|la(B)|] XC[r] «a(B)-s-JY 

t surjective 

From m and © we immediately obtain 

Pi= 5Z mon (B,£). (3) 

B=(T,h) XC[r] L\&(B)^X 
B is admissible 
B is (k, l, z)-fixed 

The claim follows by changing the order of summation. □ 

Now we are left with a tedious job of evaluating P x (x, y, z) in polynomial time. To 
simplify notation in the running time bounds, let us write m for the number of edges in G, 
and fi = 0((k+r) log(A:+r) log log(fc + r)) for the time needed to multiply or add two elements 
of F 2 rio g ((fc+7-))i+o(i). 

Lemma 2.5. Given a nonempty X C [r] and three vectors x, y, z of values in F 2 |-i og (fc)i+o(i) as 
input, all the values of P A (x, y, z) for i = 0,..., r can be computed by dynamic programming 
in time 0(mfe 4 / 2 /i) and space 0{mk 2 l). 

Proof. W.l.o.g. we assume that V(G) = {1,2,...,rz}. For a vertex a £ V(G), denote the 
ordered sequence of neighbors of a in G by a\ < a 2 < ■ ■ ■ < fldeg G (a)- We say that a U-turn- 
free branching walk B = ( T,h ) starting in a vertex a £ V(G) is j-forced when for any child 
u of the root 1 in T it holds that h{u) > aj. 

Our objective is to compute a five-dimensional array A x whose entries are defined by 

A [a,j, k ,l,i] £ ' 'y ' %h(u),h(v) Vh{q),l{q) i 

B=(T,h) Ha(B)\{l}—s-X {u,v}eE(T) q€la(B)\{l} 

u<v 
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where the outer sum is over all branching walks B which are admissible, (A/, l', *)-fixed, j-forced 
and start from a. The entries of A a admit the following recurrence. For j = deg G (a) + 1, or 
k! = 1, or V = 0, or i < 0 we have 


A x [a,j,k',l',i] 


1 if k! = 1, V = 0 and i = [a € Vj], 
0 otherwise. 


Define 


f{u, v, b) 


x u,v ^2ti,t 2 ex yv,tiUuv,t 2 

if u, v £ V 2 and 6=1, 

x u,v ^2t£X Vuv,t 

else if u, v G V 2 , 

x u,v Vv,t 

else if v € Lj or 6 = 1 

%U,V 

otherwise. 


For 1 < j < deg G (a), 2 < k 1 < k, 1 < V < l, 0 < i < r we have 


A x [a,j,k',l',i\ = A x [a, j + 1 ,k',l',i\ + 

f(a,aj, 1) • A x [a,j + 1, k' - 1, l 1 - 1 ,i - 1 - [a,a,j € V 2 ]\+ 
f(a,a-j,0)^2 AX l a ’i + ^,hji,h] • A x [ a j ,l,k 2 ,l2,i2\- 

k\+k2=k' li+l 2 =l' ii+i 2 =i— [a t aj€V2] 
fei,fe 2 >l ii>0 ii>0 

(4) 

To see that the recurrence is correct, observe that the three lines above correspond to branch¬ 
ing walks which are admissible, (k ! . I', i)-fixed, j-forced and start from a, where either (a) the 
root has no child which maps to aj, or (b) the root has exactly one child which maps to aj 
and the child is a leaf, or (c) the root has exactly one child which maps to aj and the child 
is an internal node. (At most one such child may exist because the branching walk is weakly 
simple.) 

To recover the value of the polynomial P A (x, y, z), we observe that 

P t X (x, y, z) = Y z v Y y v>t ■ A x [v,l,k,l,i] + Y Zv-A x [v,l,k,l,i]. (5) 

reVi tex vev 2 

The time bound follows by noting that the values of f(u,v) can be precomputed and 
tabulated in 0(k[n + m)n) time to accelerate the computations for the individual entries of 
A x . □ 


From Lemmas 12.41 and 12.51 we immediately obtain the following. 

Lemma 2.6. The polynomial P± r can be evaluated in time 0*(2 r ) and polynomial space. 


2.5 A Single Evaluation Algorithm 

Assume we choose the value of parameter r large enough so that if there is a ( k , i)-tree Tq 
in G , then |la(T G )| < r. Then, by Lemma 12.31 we can test the existence of a (k,l)- tree by 
testing whether the polynomial P^ r is non-zero. The latter task can be performed efficiently 
using a single evaluation of the polynomial P\ r . To show that this is the case we need the 
Schwartz-Zippel Lemma, shown independently by DeMillo and Lipton m , Schwartz [411 and 
Zippel [ 46] , 
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Lemma 2.7. Let p(x\,X 2 , ■ ■ ■, x n ) G F[x \,..., x n ] be a polynomial of degree at most d over 
a field F, and assume p is not identically zero. Let S be a finite subset of F. Sample values 
a±, 02 , ■ ■ ■ , a n from S uniformly at random. Then, 

Pr(p(ai,a 2 ,... ,a n ) = 0) < d/\S\. 

Lemma 2.8. Let V(G) = V\ U V 2 be a fixed bipartition of the vertex set of the host graph G. 
There is an algorithm running in 0*(2 r ) time and polynomial space such that 

• If G does not contain a ( k,l)-tree then the algorithm always answers NO, 

• If G contains a ( k,l)-tree Tq such that |la(T(j)| < r, then the algorithm answers YES 
with probability at least \. 

Proof. The algorithm is as follows: using the algorithm from Section [23 we evaluate the poly¬ 
nomial Pj_ r over the field GF(2^ og2 ^ fc+r ^ +1 ), substituting the indeterminates by independently 
chosen random field elements. The time bound follows from Lemma 12.61 

If there is no ( k, l )-tree in the input graph, by Lemma 12.31 the evaluation returns 0, so we 
report the correct answer. 

Now assume there is a (k, l )-tree Tq such that la (Tq) < r. Then, by Lemma 12.31 P is 
a non-zero polynomial. Note that deg(-Pj) = k + i, hence deg(P^. r ) < k + r. Hence, by the 
Schwartz-Zippel Lemma P evaluates to the zero field element with probability at most 
This finishes the proof. □ 

2.6 The random bipartition algorithm 

In this section we assume that V = V\ U V 2 is a random bipartition, i.e., every vertex goes to 
V\ independently with probability 1/2. Our plan is to choose the value of parameter r large 
enough so that if there is a (k, l )-tree Tq in G , then with high probability j la(Tc)| < r. Then, 
by Lemma [2781 we are done. Of course, putting r = k would perfectly achieve the above goal, 
but then we only get the running time of 0*(2 k ), matching that of Zehavi fU]. 

A natural choice is to set the value of r close to the expectation of | la(T/;) |. The following 
lemma follows from the definition of la (Tq), by the linearity of expectation. 

Lemma 2.9. For every ( k,l)-tree Tq in G, we have E(| la(Tc)|) = |k + \l ~\- 

By the lemma above and Markov’s inequality, if we put r = + \l, then the probability 

that |la(T(j)| < r is fl(—rj). Hence it suffices to repeat the algorithm from Lemma 12.81 (i.e.. 
evaluate the polynomial P^ r ) 0(k + l) times answering true iff at least one evaluation was non¬ 
zero, to get a Monte-Carlo algorithm for testing the existence of a (k, l)- tree. The complexity 
of this algorithm is 0*( 2 ( 3fc + 2 b/ 4 ), However, similarly as in [am, we can do better. The idea 
is to use a value of r smaller than that appearing in the expectation by an Ll(k) term. Then 
the probability that a ( k , ^-subtree is admissible is inverse-exponential. Hence, we need to 
repeat the algorithm from Lemma 12.81 exponentially many times, every time for a different 
random bipartition. However, it turns out that for carefully selected values of r, this pays 
off. To find this value, the following lemma is crucial. 

Lemma 2.10. Fix an arbitrary ( k,l)-t.ree Tq in G. For any integer t such that 0 < t < 
(.k — 1) /2, we have 

Pr(| la (7 g) | <k + ±-t)>^(^ 2t i y 
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Proof. Root Tq at an arbitrary vertex r. Let the random variable X 22 denote the number of 
edges uv £ E(Tg ) such that u, v £ V 2 . Also, let X\^ denote the number of internal vertices 
in V\. Then, by the definition of la (Tq), we have | la(G'r)! = l + X\ {1 + A 22 . 

Fix a subset of edges S £ For a = 0,1, let c a : V(Tq) —> {1, 2} be the assignment 

of vertices of Tq to sets V\ , V 2 such that for every v £ V(Tq), we have c a (v) = 1 if and only if 
on the path from r to v in Tq the number of edges from S is congruent to a modulo 2. Since 
every vertex is colored 1 in exactly one of the colorings Co, ci, we infer that 

X hi (c 0 ) + A m (ci) = k — l. 

Similarly, every edge in E(Tq ) \ S is colored 22 in exactly one of the colorings co, ci; hence 
X22(cq) + X 22 (ci) = k — 1 - 2t. 


It follows that 


min-pfi^co) + A 22 (c 0 ), Y m (ci) + Y 22 (ci)} < (k - l + k - 1- 2t)/2 

< k — | — t. 


Hence, for at least one of the colorings cq, c\, we have | la(T G )| < k + | — t. For all choices of 
S there are at least \ ( T ) such colorings, so the claim follows. □ 

The following lemma follows immediately from Stirling’s approximation. 


Lemma 2.11. For any fixed a, 0 < a < 1, 



= O* 



1 

- a ) 1 - 01 


n 


Theorem 2.12. There is an 0*(1.66 k 2 l / 2 )-time Monte-Carlo polynomial space algorithm 
for finding a ( k,l)-tree which never reports a false positive, and reports false negatives with 
constant probability. 


Proof. Fix an e > 0 and let t = |_(| + e)fcj. Put r = k — t+ = [(| — e)k~\ + [|]. We choose 
a random bipartition V = V± U V 2 and apply the algorithm from Lemma 12.81 We repeat this 
times, and we return YES iff at least one of the executions of the algorithm 
from Lemma 12.81 returned YES. If there is no (k,l )-tree in the input graph, by Lemma 12.81 
we report the correct answer. Now assume there is a (k,l )-tree Tq. Call a bipartition nice 
if |la(T G )| < r. By Lemma 12.101 a random bipartition is nice with probability at least 
p = 7^+1 ( fc ^ 1 ) • Hence, at least one of the tried bipartitions is nice with probability at least 
1 — (1 — p) 1 ^ > 1 — 1/e. For such a bipartition, the algorithm from Lemma [2.81 answers YES 
with probability at least /. Hence our algorithm reports a false-negative with probability at 
most 1/e + \ < 1- 

By Lemma 12.81 the running time is 


O* 


w+k , 


k- 1 
2 1 


= O* 


2 (7/4-6)fe+Z/2 1 f 


V(l/2 + 2 e)k 


By Lemma [2.111 we can express this by 0*((f(e)) k 2^ 2 ), for /(e) = 2 7 / 4 e (i + 2 e) 2 +2e (i — 
2 e) 2 ^ 2e . The function / attains a minimum smaller than 1.65685 for e = 0.042894. Hence 
the claim. □ 
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3 Spanning Trees with Many Internal Vertices 


In this section we give an improved algorithm for fc-IST. As previous algorithms (see, e.g., 
0 ) for this problem, we depend on the following insight. 

Lemma 3.1. A graph G contains a spanning tree with at least k internal vertices iff G 
contains a subtree with exactly k internal vertices and at most k leaves. 

We call such a tree a witness for /c-IST. A witness tree is minimal if one cannot remove 
any leaf from it without reducing the number of internal nodes. It is easy to see that such a 
tree can be extended to a fc-IST simply by connecting each remaining vertex in the graph to 
the tree arbitrarily. 

We will use two strategies to search for a minimal witness tree. In the first, which we 
refer to as strategy A, we search for a (k + l, l )-tree for every l = 2 ,..., ak, for some a to 
be specified later, with our algorithm from Theorem 12.121 Its runtime gets worse the more 
nodes, and especially leaves, the tree has. Running only strategy A to solve for a fc-IST (i.e. 
let a = 1) results in an 0*(3.883 k ) time algorithm using polynomial space, already improving 
over state of the art. 

In our second strategy, strategy B, we search indirectly for a (k + 1,1 ')-tree for every 
l = ak,... ,k, where l 1 < l, by using an observation from Zehavi [35], that a tree with many 
leaves can be trimmed to a small tree with few leaves and a disjoint matching. Furthermore, 
such a disconnected subgraph can easily be combined into a (k + 1, l ')-tree for some V < l by 
connecting the edges of the matching one-by-one to the tree in an arbitrary order. 

For completeness, we state and prove the following lemma implicit in [45] : 

Lemma 3.2. Every minimal (k + 1, l)-tree with l > (/c/2) + 1 has a subgraph consisting of a 
minimal (k + l — 2, l — 1 )-tree and a disjoint edge. 

Proof. Let us root the tree at an arbitrary leaf. Every leaf’s parent has only one child, 
otherwise it will not be minimal since we can remove the leaf. Associate with a leaf £ the 
path p(£) of all internal vertices which have only the leaf £ as their descendant among all 
leaves. Note that every internal node is associated with at most one leaf, and that p{£) 
contains at least the parent (since the tree is minimal). If there is no leaf £ with exactly one 
internal node in p(£), we would have at least twice as many internal nodes as leaves violating 
the assumption l > [k/ 2) + 1. Hence there is a leaf £ with \p{£)\ = 1, and we can remove 
£,p(£) as the disjoint edge from the tree, leaving a minimal (k + l — 2 , l — l)-tree. □ 

By iteratively applying the lemma above, we have that 

Corollary 3.3. If a k-IST instance graph G has a ((1 +/3)k, fk)-tree witness for some /3 > 0.5, 
then it also has a minimal (3(1 — /3)k, (1 — /3)k)-tree and a disjoint (2/3 — 1 )k edge matching. 

In contrast to [45], that uses representative sets to separate the matching from the tree for 
fc-IOB (the directed version of the fc-IST problem), we will simply use a random bipartition. 
It turns out to be a faster algorithm because in our algorithm for (k + l, /)-tree from Theo¬ 
rem [ 2 T 2 ] the leaves cost much more than internal nodes, unlike in the directed counterpart 
from [53J where all nodes cost the same. 

For each l = ak,..., k, we let 7 = l/k and repeat the following r(y) times. We partition the 
vertices in Vm U Xf = V. With probability p(y) we put a vertex in Vm, otherwise we put it in 
Vr- First we compute in polynomial time a maximum matching in the induced graph G[Vm]- 
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If we find one with at least (27 — 1 )k edges, then we also search for a (3(1 — 7 )k, (1 — 7 )fc)-tree 
in the induced graph G[Vt] with the algorithm from Theorem 12.121 If we also find a tree, we 
combine the found matching and the tree into a (k + l, l ')-tree witness and output yes. If all 
runs for all l fail we output no. We set the parameters r( 7 ) and £>( 7 ) as 


r( 7 ) 


1 + 7 V 47 2)k ( 1 + 7 
47-2/ \ 3(1 — 7 ) 


3(1—7)fc 

, P{ 7) 


47-2 
1 +7 ' 


The probability that none of the r( 7 ) tries succeeds in separating the tree and the matching 
is at most 


1 - 


47-2 

1 + 7 


(47—2)fc 


3(1 ~ 7) \ 
1 + 7 / 


3(1—7)fc\ r O 


(l-r( 7 )- 1 ) r W < e” 1 . 


We get that with probability at least 1 — e -1 we successfully separate the matching and 
the tree in the bipartition, and hence strategy B works. The running time for a fixed ratio 
7 is 0*(r(7)1.657+ 1 ~+ fc 2( 1- + fc / 2 ). One can check that this is a decreasing function for 7 £ 
[0.68,1]. Hence, when a € [0.68,1], the total time of strategy B is 0*(r(a)1.657 3 +^"^ fc 2^ 1_Q ^ fc//2 ) 
On the other hand, strategy A takes time 0*(1.657( 1+< + fc 2" fc//2 ), which increases with a. The 
best trade-off between strategy A and B is obtained for a = 0.8627..., which gives a total 
running time of 0*(3.455 k ). 

For a = 0.8627, in both strategies, the worst running time bound is obtained when l = ak 
and k + l = n; indeed, if k + l > n, it is clear that there is no solution. Therefore, the worst 
running time bound is obtained when k = n/( 1 + a), which makes our algorithm run in time 
0 *( 3 . 455 n// T+ Q )) = 0*( 1.946™). Thus, we have proved the following result. 

Theorem 3.4. There is an 0*(min{3.455 fc , 1.946 n })-time Monte-Carlo polynomial space al¬ 
gorithm for k- 1ST in general graphs. 


4 Colored Graphs 

In this section we improve the running times of algorithms from Section [2] in restricted settings. 
This is done by adjusting the partition V = V\ U Vi to particular graph classes where vertex 
colorings guide us in making the partition. We will consider three ways of coloring the vertices. 
The first is ordinary proper vertex coloring of the graph, i.e. color the vertices so that no 
edge has both its endpoints colored by the same color. The least number of colors needed 
is denoted by \G- The second way is fractional vertex coloring, that assigns a subset of b 
colors to each vertex from a palette of a colors so that the endpoints of each edge receive 
disjoint subsets of colors. The smallest possible ratio a/b is denoted by Xf{G). The third 
way is vector coloring that assigns unit length vectors to the vertices so the minimum angle 
a between every edge’s endpoints’ vectors is as large as possible. The smallest possible value 
of 1 + cos -1 (a) is denoted by Xv{G). 

The following chain of inequalities holds (see e.g. ESI), where u>(G) is the clique number, 
i.e., the size of the largest clique in G. 

Theorem 4.1. For any graph G, 

w(G) < X v(G) < Xf (G) < X (G). 
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4.1 The Ordinary Vertex Coloring 

Consider a proper d-coloring c : V —> {1,... , d} of the host graph G. Fix a number t G 
{0,... , d}. Our idea is to define V\ as the union of t color classes, and V 2 as the remaining 
vertices. Clearly, for some choices of the colors the set la(T’G') for a solution Tq can be large, 
and then by Lemma 12.31 we need to set the parameter r high, which makes the running time 
large. However, if we try all the possible choices of t colors, in at least one of them the set 
la (Tq) will be small enough. This is stated in the following lemma. 


Lemma 4.2. Let c be a given d-coloring of G. Let Tq be a fixed ( k,l)-tree in G. There is a 
choice oft color classes ci,... , q such that if we put V\ = (J* =1 c _1 (i), then 


HTq) < 


f _ x(d-x) \ 

V d(d-i)J 


k + (l 



l, 


where x 


1 d+(l/k)(d- 1 ) 
L 2 


In particular, \ 1 &(Tq)\ < 


lIj rli \ 

d(d- 1 ) J 



Proof. For i = 1 ,..., d, let ki denote the number of nodes of Tq colored by i. Similarly, let /j 
denote the number of leaves of Tq colored by i. Finally, for i, j = 1 ,..., d, let h h j denote the 
number of edges in Tq with one endpoint colored with i and the other colored with j. Note 
that 


J2h = k 


i= 1 
d 


Tu = l 

2—1 

l<i<j<d 


( 6 ) 

(7) 

( 8 ) 


Fix t = 0,... ,d. It follows that the average size of | la(Tc)|, over all possible choices of 
the set S of t colors, equals 


< 


Jd\ £ + ki d 

\t) Se( [ f) \ i€S S {i,j}ns=<t) 

(?) 

For t = x and t = \d/ 2], we get the claimed bounds J1 


□ 


2 We chose t = x in order to minimize the expression ^1 — ^ k + The choise t = [d/2] gives a 

simpler bound, but, if l is large, it is greater than the one with x. 
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By combining Lemma 12.81 with Lemma 14.21 we get the following theorem. 


Theorem 4.3. Assume we are given a proper d-coloring of the host graph, for some fixed 
d. Then, there is a Monte-Carlo polynomial space algorithm for finding a ( k,l)-tree running 

in time O* ^2^ ' J ( d - 1 )) fc+ ( 1 ~z) l \, where x = | d +(i/k)(d i) -j. j n p ar ti cu i ar , ih e running time 

_ d(d- 1 ) 2 


can be bounded by O* 


The algorithm never reports a false positive, and 


reports false negatives with constant probability. 


4.2 Fractional Vertex Coloring 

There is a considerable gap between the running time of the (k, l)- tree algorithm for 2- 
colorable graphs and 3-colorable graphs: 0*(1.415 fc+? ) vs 0*(1.588 fc 1.415 ; ). It is natural 
to ask whether for graphs with chromatic number 3, which are in some sense almost 2- 
colorable , one can do better than just run the algorithm for 3-colorable graphs. To address 
this and related questions in this section, we consider a relaxed concept of graph coloring, 
called the fractional chromatic number. We say that a graph G has an (a : 6)-coloring if, 
to each vertex of G , one can assign a 6-element subset of {1,2,3,... ,a} in such a way that 
adjacent vertices are assigned disjoint subsets. The fractional chromatic number is defined 
as Xf(G) = inf{! | G can be (a : 6)-colored}. To avoid confusion let us clarify that for 
| = ^, the fact that G is (a : 6)-colorable does not imply that it is (c : d)-colorable (for an 
example consider Kneser graph A ^ : 2 which is (6 : 2)-colorable but not 3-colorable, see e.g. the 
monograph of Scheinerman and Ullman [40]). In particular, not every (a : 6)-colorable graph 
is -colorable. Indeed, the gap between the fractional chromatic number and the chromatic 
number can be arbitrarily large. 

Now we show an analog of Lemma 14.21 

Lemma 4.4. Let c : V(G ) —> (^) be a given (a : b)-coloring ofG. LetTc be a fixed ( k,l)-tree 
in G. There is a set of t colors S £ (^) such that if we put V\ = {v £ V(G) j c(u) D S = 0} 
and V 2 = V(G) \ Vi, then 


la(lc)| £ (l- Gl-JiTl'j 1 , + ^ 


1_ML 

(?) j 


In particular, there is a color r £ [a] such that if we put V\ = {w £ V(G) \ r ^ c(u)} and 
V 2 = V{G)\V 1 , then 


|la(T G )| < ( + 

a J a 


Proof. For a given set of t colors S, let Vf = {v £ V{G) \ c(v) D S = 0} and Vf = V \ Vf. 
For two indices i, j £ {1, 2} define the set 

Efj = {uv £ E(Tg ) | u is the parent of v, u £ Vf and v £ Vf}. 
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Also, let if be the set of internal vertices of Tq in Vi. Similarly, let Lf be the set of leaves of 
Tq in Vi. Then, by the definition of la(Tc), 

I la(7b)| = l + \Ii I + \E%2 1 = 

l + |/f | + \V 2 | — |^i 2 1 ~ [root(Tcr) E y 2 s ] < 
l + k-\Lf\ - \E? 2 \. 

For I E (^), let kj denote the number of nodes of Tq colored by set I. Similarly, let Zj denote 
the number of leaves of Tq colored by I. Finally, for /, J E (^), let kj.j denote the number 
of edges in Tq with the parent colored with I and the child colored with J. Note that 

J2 kl = k ( 9 ) 

/e(N) 

£ h = l (10) 

/ e (W) 

^2 k i,J = k — 1. ( 11 ) 

/,Je( [ “ ] ) 

It follows that the average size of | la(Tc)|, over all possible choices of the set S of t colors, 
equals 


pT ^2 ( k _ + 1 - \ L 1 I) - 

\t) sre([“ ] ) 

A E (*■■- E k ‘j +l - E '' 

U 5e ( [ ? ] ) hJe(W) /e(W) 

/n5=0 /ns=0 

Jns^0 


yyy Z] ( k ~ ^2 kl ’ J+ ll 


se( [ “ ] ) 


1 - 


/,Je(W) 
/ns=0 

(Vo-r^n 


/,Je( [ “ ] ) 

inS=Jns=® 


/ G (W) 

/ns=0 


G) J V (?) . 

This implies the first part of the claim. The second part follows after putting t = 1. □ 

By combining Lemma 12.81 with Lemma 14.41 we get the following theorem. 

Theorem 4.5. Assume we are given a proper (o : b)-coloring of the host graph, for some 
fixed, a,b. Then, for any t = 1,... ,b there is a Monte-Carlo polynomial space algorithm for 
finding a ( k,l)-tree running in time 


o* 


which never reports a false positive, and reports false negatives with constant probability. 
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Note that t = 1 gives time of 0*( 2 ( 1 ~a) k+ a l f For fixed values of a and b, it is easy to 
find optimal value of t in Theorem 14.51 

The following equivalence is well-known (see e.g. |16]L 

Proposition 4.6 (folklore). Graph G has fractional chromatic number x/(G) < r iff there 
exists a probability distribution it on the independent sets of G such that for each vertex v, 
the probability that v is contained in a random independent set (with respect to ir) is at least 
1/r. 

Proposition 14.61 is interesting because an upper bound B on the fractional chromatic 
number can be proven by providing a polynomial-time randomized algorithm which outputs 
an independent set I such that, for every vertex v, the probability that v € I is at least 1/B. 
This strategy was used e.g. by Ferguson et al. m • It is not hard to show that having such 
an algorithm one can build in randomized polynomial time an (a : 6)-coloring of G such that 
D = B + e for arbitrary fixed e > 0. However, for our applications in (k, l)- tree this is not 
needed as observed in the following lemma (the main observation behind this lemma traces 
back to Bjorklund[2], Theorem 2). 

Theorem 4.7. Assume there is a polynomial-time randomized algorithm A which outputs an 
independent set I such that, for every vertex v of G, the probability that v € I is at least p, 
for some p £ (0,1). Then, there is a Monte-Carlo polynomial space algorithm for finding a 
(. k,l)-tree running in time 

0 *( 2 ( ' 1 ~ p ) k+pl ) 

which never reports a false positive, and reports false negatives with constant probability. 

Proof. (Sketch.) Put r = \(\ — p)k + pk\ + 1. Let Tq be a (k, /)-tree in G. Run the algorithm 
A and let I be the resulting independent set. Put V\ = V \ I and V 2 = I. It is easy to 
see that E(| la(Tc)|) = \V{Tg) fl Li| + \L{Tq) fl V 2 [ = (1 — p)k + pi < r — 1 . By Markov’s 
inequality, Pr(|la(Tf;)| < r) = fi(-) = £7(4). We run the algorithm from Lemma 12.81 and it 
finds a (k,l)- tree with probability H(|). By repeating the whole procedure 0(k) times, we 
can increase the success probability to a positive constant. □ 

4.3 Vector Coloring 

A vector coloring of a graph G on n vertices of value r > 1 is an assignment of unit length 
vectors v u in M n for each vertex u so that for every edge uv, v u ■ v„ < — l/(r — 1). The vector 
chromatic number Xv(G) is the smallest real r admitting a vector coloring. 

Once we have a vector coloring, we can use a random hyperplane through the origin with 
normal h chosen uniformly over the surface of the unit hypersphere to divide the vertices in 
V] and V 2 : For vertex u, we put u in V\ if v u ■ h > 0; otherwise, we put it in V 2 . 

By the linearity of expectation, we can upper bound the expected number of labels as 
the expected number of vertices put in V\ , which is half of them, plus an upper bound of the 
expected number of edges in the tree that have both endpoints in V 2 , which depend on the 
angle between these two vertices’ vectors. The vector coloring value gives us a lower bound 
on the size of this angle. Paraphrasing Lemma 3.2 in Goemans and Williamson [23] . we have 

Lemma 4.8. For a random hyperplane normal h and two vectors v u and v v , we have 

Pr(sign(v u • h) ^ sign(v„ • h)) = - arccos(v u • v v ). 

7r 
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Hence we have 


E(| la(Ib)|) < k -±l + fl - arCC0S (- * 1 / (r - 1 )) ) 

Z y 7T J Z 

By Markov’s inequality and by using that | la(Tc)| is an integer quantity, we see that such a 
random hyperplane gives us at most the expected number of labels with probability at least 
1/(E(| la(Tc)l) +1). If we pick E(| la(Tc')l) +1) hyperplanes independently of each other, with 
probability at least 1 —e _1 one of them will use at most E(| la(T(j)|) labels. Since E(| la(Te)|) 
is a polynomial in fc, we can run Lemma 12.81 for each hyperplane to obtain the following 
algorithm guarantee. 

Theorem 4.9. Assume we are given a vector coloring of the host graph of some value r. 
Then there is a Monte-Carlo polynomial space algorithm for finding a ( k,l)-tree running in 
time 

/ ( k+l , (, arccos( —l/(r—1)) \ fc —1 

O* ( 2V“ + V i * ) — 

which never reports a false positive, and reports false negatives with constant probability. 



5 Corollaries 


5.1 Solving (&, Z)-Tree and fc-IST in Bounded Degree Graphs 

For a graph of bounded degree A that is neither complete nor an odd cycle, it is possible to 
construct a proper A-coloring in linear time (e.g., by Lovasz’s proof of Brooks’ theorem |33]). 
Note that, given a graph that is either complete or an odd cycle, (k , Z)-Tree can clearly be 
solved in polynomial time. In conjunction with Theorem 14.31 this yields the following result. 


Corollary 5.1. There is a Monte-Carlo polynomial space algorithm for (fc, Z)-Tree in graphs 
of bounded degree A which runs in timeO* ^2^ where x = [A+A/MiAzAl - ] _ 


In particular, the running time can be bounded by O* 


With l = 2, the 


same result holds for fc-PATH, and with k = n, for Hamiltonicity. 


In particular, if A < 13 we get fc-PATH and Hamiltonicity algorithms in 0*(1.655 fc ) 
and 0*(1.655 n ) time, respectively, that are faster than the algorithms from 0112]. 

As shown in |44] . a graph G of bounded degree A has a spanning tree with at least fc 
internal vertices iff it contains a tree with exactly fc internal vertices and at most fc — ^~\ = 
^^|fc + 0(1) leavesH Therefore, by the above corollary[3 we can solve fc-IST in time 


O* 


l- 


r(A — 


A(A 


^)(i+&E?)*+(i-i)(&E?)* 


where x = • That is, we can solve 


3 For graphs with a small bounded degree, this bound is better than the one of Strategy A in Section[3] (i.e., 

I < 0.8628fc). Indeed, for A < 8, this bound implies that l < 0.8572 k. 

4 Note that now, in the context of fc-IST, fc is the number of internal vertices, where in Corollary 1 5. 11 it is 
the total number of vertices. 
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Reference 

Variant 

A = 3 

A = 4 

A = 5 

A = 6 

Zehavi (43] 

Directed 

0*(2.51985 fc ) 

0*(2.99662 fe ) 

0*( 3.24901*) 

0*( 3.40267 fc ) 

Corollary 15.21 

Undirected 

0*(2.24493 fc ) 

0*(2.66968 fc ) 

0*(2.87787 fc ) 

0*(3.00355 fc ) 

Raible et al. |38] 

Undirected 

0*(1.96799 n ) 

0*(1.98735 n ) 

0*(1.99376 n ) 

0*(1.99777 n ) 

Corollary 15.21 

Undirected 

0*(1.71449 n ) 

O*(1.80251 n ) 

0*(1.82948 n ) 

0*(1.84227 n ) 


Table 3: Some concrete figures for running times of algorithms for /c-IST in bounded degree 
graphs. 


/c-IST in time O* 


/3A-5l ea;(3A 2 +32: + 2-6A-2a;A) 

' A —1 > t A(A-l)- ' 


. Moreover, the worse running time is ob- 


fA- 3 n ’ which makes our algorithm run in time O* 


tained when k + l = n and l = Therefore, the worse running time is obtained when k = 

,3A-5\ , x(3A 2 +3x+2-6A-2xA) J f A-l 1„\ 

^ A-l > V A(A-l)' 2 > L2A-3J" 1 

r 

. Thus, we also have the following 


which is equal to O* ( 2 - 
result. 


f 3A —5 /x(3A 2 +3z+2-6A-2a:A) N 

\2A-3) ' A(A —1)(2A —3) ' 


Corollary 5.2. There is a Monte-Carlo polynomial space algorithm for /c-IST in graphs of 
bounded degree A which runs in time 


O* (min 


2 L 


^ 3A-5 -, , x(,3A 2 +3x+2-6A-2xA) ^ 

' A —1 ' ^ A(A —l) 2 ' 


k 


/ 3A-5 \ ( x(3A 2 +3x+2-6A-2xA) n 

V2A-3^ v A(A —1)(2A —3) / 


n 


), 


where x 


I 3A 2 -6A+2 i 
L 2(2A-3) I' 


5.2 Subcubic Graphs of Large Girth 

Fractional chromatic number has been intensively studied for graphs of maximum degree 3 
(aka subcubic graphs) of large girth. Perhaps the most important result in this held is due to 
Dvorak, Sereni and Volec m , stating that every subcubic triangle-free graph has fractional 
chromatic number at most 14/5. Unfortunately, their proof does not provide a polynomial¬ 
time algorithm for finding such a coloring. By Lemma 14.41 such an algorithm would imply 
an algorithm for (/c, /)-tree running in time 0*(2^ k+ ^ 1 ) = 0*( 1.562 fc 1.281 z ). However, we 
may use an earlier result of Liu m, which gives a bound of ||. Liu’s proof is inductive and 
can be turned to a polynomial-time algorithm which finds a (516 : 180)-coloring |35J. By the 
second claim of Lemma 14.41 we get the following corollary. 

Corollary 5.3. There is a O*(1.571 k 1.274?)-time Monte-Carlo polynomial space algorithm 
for finding a ( k,l)-tree in a subcubic triangle-free graph. 

Hatami and Zhu m analyzed the following simple algorithm for sampling an independent 
set I: choose a random bijection ir : [n] —> V ( G ), and for every i = 1 ,..., n, include n(i) in the 
independent set / if none of its neighbors is included in /. They show lower bounds pk for the 
probability that any vertex v is included in /, assuming that G is subcubic and has girth at 
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least 2fc+l. In particular, they proved that P3 = 1/2.78571 > 0.3589, p§ = 1/2.67215 > 0.3742 
and py = 1/2.66681 > 0.3749. These bounds, together with Lemma 14.71 imply Monte-Carlo 
algorithms for finding a (A;, i)-tree in a subcubic graphs of girth at least 7, 11 and 15 running 
in times O*(1.5595 fc 1.2830, 0*(1.5431 fc 1.2962 z ) and 0*(1.5423 fc 1.2969 z ), respectively. 


5.3 /c-Path in Undirected Subcubic Graphs 

In this section we reduce the A;- Path problem in subcubic graphs to vertex-weighted A’-Path in 
triangle-free subcubic graphs. This is inspired by a similar technique used by Feder, Motwani 
and Subi m and Eppstein [T4j for finding long cycles, where triangles are removed at a cost 
of introducing weights at edges. By plugging in results from Section lT2l we get an 0(1.571 fc )- 
time algorithm. This is faster than the algorithm running in time 0( 2 2//3fc ) = 0(1.588 fc ) that 
follows from Corollary 15.11 The improvement we get is admittedly modest, but at least it 
shows that the natural barrier of 0( 2 2 / 3fc ) can be broken. 

Let G be a graph and let w : V ->Nbea weight function. Let vertices a, b and c form 
a triangle in G. By G/abc we denote the graph obtained by contracting vertices a, b and c 
into a single vertex t, without introducing multiple edges or loops (this is sometimes called a 
A — Y transform). Let w/abc : V(G/abc) ->Nbea weight function defined by 


w/abc(x) 


w(x) if x / t, 

w(a) + w(b) + w(c) otherwise. 


Lemma 5.4. Let ( G,w ) be an undirected vertex-weighted subcubic graph, and let vertices a, 
b and c form a triangle in G. If (G, w) contains a path of weight k and length d, then 
(G/abc, w/abc) contains a path of weight at least k and length at most d. Moreover, if 
(G/abc, w/abc) contains a path of weight k, then (G,w ) contains a path of weight k. 

Proof. Let P be a path of largest weight in (G, w) among paths of length at most d. If 
{a,b,c} PI V(P) = 0 then P is a path of weight w(P ) in (G/abc, w/abc). Hence assume 
{a,b,c} n V(P) / 0. We claim that there is a path P\ of length at most d in (G,w) such 
that vertices { a,b,c} n V(P) appear consecutively in P. Indeed, if {a,b,c} fl V(P) do not 
appear consecutively in P then for two of them, say for a and b, for some integer p > 1, path 
P contains a subpath av\ ... v p b, such that c 0 {iq,..., v p }. Since G is subcubic, one of the 
vertices a, b is an endpoint of P; assume w.l.o.g. that a is an endpoint of P. Then we set 
Pi = P — {bv p } U {ab}. Note that Pi has the same length as P and if c £ V(Pi), then c is 
a neighbor of b in Pi. Hence Pi is the claimed path. Let t be the vertex of G/abc obtained 
by contracting triangle abc. After replacing cba in Pi by t, we get a path of weight at least 
w(P) in (G/abc, w/abc). 

Consider a path P' in (G/abc, w/abc). If t 0 V (P'), then P' is a path of weight wfabc(P') 
in (G, w). Otherwise it is easy to see that we can obtain a path of weight w/abc(P') in (G, w) 
by replacing t in P' by one of the paths abc, bca or cab. □ 

Corollary 5.5. For a given an undirected subcubic graph G one can build in 0(n) time a 
weighted subcubic triangle-free graph (G',w) such that G contains a path of length k iff G' 
contains a path of weight at least k and length at most k. 

Proof. Assign weight 1 to every vertex and apply Lemma 15.41 as long as the graph contains a 
triangle. □ 
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Now we observe that Lemma [2.81 generalizes to vertex weighted graphs, provided that the 
weights are integral, positive and not too big, similarly as shown in [ 2 ]. 

Lemma 5.6. Let W be a positive integer. Let G be an undirected graph with a vertex-weight 
function w : V(G) —> [W]. Let V(G) = V\ U Vi be a fixed bipartition of the vertex set of the 
host graph G. There is an algorithm running in 0*(2 r W + WlogW) time and polynomial 
space such that 

• If G does not contain a (k, l)-tree of weight at least W, then the algorithm always answers 

NO, 

• If G contains a ( k,l)-tree Tq of weight at least W such that |la(Tc)| < r, then the 
algorithm answers YES with probability at least 

Proof. (Sketch.) We add an additional indeterminate rj in such a way that a vertex x of 
weight w(x) contributes rj w ^ to the monomial. More precisely, for a branching walk B and 
labelling £, we redefine the corresponding monomial as follows. 

mon(B,£) = z h( i) JJ x h{u)Mv) JJ y h (q),e(q) II V w ^ x) \ 

{u,v}eE(T) gela(B) i£ V(T) 

u<v 

It is straightforward to verify that the proof of Lemma 12.21 is still valid in this setting and 
the polynomial evaluation algorithm can be easily adapted. Let us group the terms of Pp. 
according to the exponent at 77 . 

p ir = J2 R pf- 

i> 0 

Our task reduces now to determine whether there is an integer i* > W such that the poly¬ 
nomial Ri* is non-zero. This is done in a standard way, by sampling the values of all inde- 
terminates except for r]. Then, by Schwartz-Zippel Lemma, P ? ;*(x, y,z) 0 with probability 
at least 1 / 2 , provided that we use a field and of size at least 6 k (and still of characteristic 
two). The value of the largest integer i such that Ri (x, y, z) 7 ^ 0 can be determined using 
interpolation in 0(W log IT) time after W evaluations of the polynomial, see e.g. [2] or [5] for 
details. □ 

By combining Corollary 15.51 Lemma 15.61 (for 1 = 2), Lemma 14.41 and Liu’s bound [31] 
similarly as in Section T5.21 we get the following corollary. 

Corollary 5.7. There is a O* (1.57 l fc ) -time Monte-Carlo polynomial space algorithm for find¬ 
ing a k-path in an undirected subcubic graph. 

We note, similarly as we did in Section 15.21 that if there is a (possibly randomized) 
polynomial-time algorithm that finds an a : b coloring for a/b = 14/5, then the above bound 
improves to 0*(1.562 fc ). 

5.4 Solving (k,l)~ Tree in Graphs with Small Vector Chromatic Number 

One of the most striking powers of semidefinite programming is that it is possible to obtain 
a vector coloring of value r = Xv(G) + e for any fixed e > 0 in polynomial time, see for 
instance m- Combining such a vector coloring algorithm with Lemma 14.91 we have the 
following corollary. 
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Corollary 5.8. There is a Monte-Carlo polynomial space algorithm for (A;, /)-Tree that for 
every fixed, constant e > 0 runs in time 

/ ( k + l , ( 1 arccos( — l/(xu(G) + e— 1))\ k-1 

o* ( 2 V~ + V i “ ) — 

With l = 2 the same result holds for fc-PATH and with k = n for Hamiltonicity. 

In particular, if Xv(G) < 8, we get A;- Path and Hamiltonicity algorithms in 0*(1.655 fc ) 
and 0*(1.655 n ) time, respectively, that are faster than the previous algorithms |I]j2]. 

Let us observe that thanks to Theorem SIO we can apply Corollary 15.81 whenever we 
know that a chromatic number or the fractional chromatic number is low, even if no efficient 
algorithm for finding the coloring is known. For example, since Xf < 14/5 for triangle-free 
subcubic graphs m , by plugging it into Corollary 15.81 and using the triangle-removing trick 
from Section [5T3l one gets an 0*(1.576 fc )-time algorithm for /c-Path in subcubic graphs (which 
is slightly worse than the bound of Corollary 15.71 but avoids the use of the complicated Liu’s 
algorithm m)- 
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